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Abstract. The purpose of this Note is twofold: First, we introduce the gen- 
eral formalism of evolutionary genetics dynamics involving fitnesses, under 
both the deterministic and stochastic setups, and chiefly in discrete-time. In 
the process, we particularize it to a one-parameter model where only a selection 
parameter is unknown. Then and in a parallel manner, we discuss the esti- 
mation problems of the selection parameter based on a single-generation fre- 
quency distribution shift under both deterministic and stochastic evolutionary 
dynamics. In the stochastics, we consider both the celebrated Wright-Fisher 
and Moran models. 
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1. Introduction and outline 

In this Note, we revisit the basics of both the deterministic and stochastic dynamics 
arising in discrete-time evolutionary genetics (EG). We start with the haploid case 
with K alleles before switching to the more tricky diploid case. In the course of 
the exposition, we shall focus on a particular one-parameter selection instance of 
the general fitness model for which only the selection parameter is assumed to be 
unknown. 

Let us summarize and comment the content of Section 2. In the deterministic 
haploid case, the updates of the allele frequency distributions are driven by the 
relative fitnesses of the alleles, ending up in a state where only the fittest will 
survive. From the dynamics, it appears that the mean fitness increases as time 
passes by, the rate of increase being the variance in relative fitness. This constitutes 
the core of the Fisher theorem of natural selection (FTNS) . Introducing a discrete 
version of the Fisher information on time brought about by the allelic frequencies, 
it follows that one can identify this Fisher information with the variance in relative 
fitness. 

In the deterministic diploid case, there is a similar updating dynamics but now on 
the full array of the genotype frequencies. When mating is random so that the 
Hardy- Weinberg law applies, we may look at the induced marginal allelic frequen- 
cies dynamics. It follows that the induced closed-form allelic updating dynamics 
looks quite similar to the one occurring in the haploid case except that the mean 
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fitness now is a quadratic form in the current frequencies whereas marginal fitnesses 
no longer are constant but affine functions in these frequencies. In this context, 
the FTNS still holds true but, as a result of the fitness landscape being more com- 
plex, there is a possibility for a polymorphic equilibrium state to emerge. A short 
incursion in the continuous-time setting shows that here again one can identify the 
variance in relative fitness to a familiar Fisher information on time brought about 
by the frequencies. If one rather looks at the partial rate of increase of the mean 
fitness, one can identify it with half the allelic variance in relative fitness which 
is that component of the full genotypic variance in relative fitness which can be 
explained additively by the alleles constituting the genotypes, [3]. The remaining 
interaction part of this decomposition of the full genotypic variance can naturally 
be attributed to the dominance relationships between the alleles. In the interpre- 
tation of some authors, including W. J. Ewens ([3], p. 64 — 67), this last property 
based on the partial rate of increase rather constitutes the essence of the FTNS. 
Using this circle of ideas, it follows that looking at the partial rate of increase of 
the mean fitness also makes sense when dealing with the full array of the genotype 
frequencies, no matter what form of mating is at stake. It also involves an allelic 
variance in relative fitness. In each case, we keep looking at the incarnation of these 
results when dealing with the one-parameter selection model. So far the results in- 
troduced and discussed can be found to be classical, our own contribution being 
perhaps to put things in order and fix the notations and formalism in a clear way. 
An excellent introduction to these and related problems can be found in [7|. We 
believe that the following developments can be considered as being entirely new. 

To end up with Section 2, we discuss the estimation problem of the selection param- 
eter based on both the current and shifted allele frequencies observations. Similarly, 
we discuss this problem when the observable is a general scalar output of the current 
frequency distribution. When looking at the updating of this output, we encounter 
a particular incarnation of the Price equation, [1] . 

Section 3 is devoted to the stochastic version of these considerations when the 
transitions in the constitutive allelic population sizes are given by a K— dimensional 
Wright-Fisher model with total constant-size (see [3] and [8]). We show that the 
mean of the increment of the random absolute mean fitness is positive, whereas 
its rate of increase differs from its variance. We suggest that when the size of the 
total allelic population goes to infinity, one should recover part of the marginal 
deterministic theory. In the selection example, we compute the classical Fisher 
information on the selection parameter and exhibit its possible use in the estimation 
problem. We finally present some comparative issues pertaining to a related model 
of fundamental importance in the context of stochastic EG: the Moran model. 

Lots remain to be done in the same spirit, in particular including mutations and 
considering the multi-loci case with recombination. We again emphasize that in 
our models, there are no mutations included. 

2. EG THEORY: THE DETERMINISTIC POINT OF VIEW 

We start with the haploid case before moving to the diploid case, see ([3] and [7] 
for similar concerns). 
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2.1. Single locus: haploid population with K alleles. Consider K alleles Ak, 
k = 1, K attached to a single locus. Suppose the current time-t allelic frequency 
distribution on the K— simplex Sk is given by Xk, k = 1, ...,K. Let x := Xk, k = 
1, ...,K stand for the column-vectoiQ of these frequencies with |x| := J2k Xk = 
Let Wk > 0, k = 1, K be the absolute fitness of allele Ak- Let 

(1) w fc x) = — — 

w (x) 

be the relative fitness of allele Ak where w (x) := wixi represents the mean 
fitness of the population at time t. We shall also let 

K 

(2) cr 2 (x) = J^x fc Ofc - w (x)) 2 

fc=i 

stand for the variance in absolute fitness and 

K 

(3) a 2 (x) = £ x fe (w k (x) - l) 2 = a 2 (x) /«, (x) 2 

fe=i 

will be the variance in relative fitness. 

From the deterministic EG point of view, the discrete-time update of the allele 
frequency distribution on the simplex Sk is given bjQ 



(4) 4=p fe (x), k = l,...,K. 

where pk (x) := XkWk (x) . The quantity Wk (x) — 1 interprets as the frequency- 
dependent Malthus growth rate parameter of Xk- 

The vector p (x) := pk (x), k = 1, ■■■ 1 K, maps Sk into Sk- In vector form, with 
w(x) := Wk (x), fc = 1, K and D x :=diag(a;fe, k = 1, X), the nonlinear deter- 
ministic EG dynamics reads: 

x' = p (x) = £> x w (x) = Z? w(x) x, 

or, with Ax := x' — x, the increment of x 

Ax = (l>w(x) - ^) x. 

Avoiding the trivial case where fitnesses are all equal, without loss of generality, we 
can assume that either wi > ... > wk — 1 or w\ < ... < wk = L Thus allele Ai 
or Ak has largest fitness. The deterministic EG dynamics attains an equilibrium 
where only the fittest will survive. The equilibrium is an extremal state of the 
boundary of Sk- 



Example (selection). In general, all the Wk are unknown but sometimes, the set 
of unknowns can be reduced to 1 as follows: Let s > — 1 stand for a selection param- 
eter. Let ak, k = 1, K stand for a known [0, 1] —valued decreasing sequence with 
a\ = 1, or- = and assume Wk — 1 + sak- The fitness landscape is w (x) = 1 + sa (x) 
where a (x) := akXk- A possible choice of ak is ak = (K — k) / (K — 1) leading 
to equally spaced fitnesses with Wk+i — Wk = —s/ (K — 1) . An alternative choice is 

4n the sequel, a boldface variable, say x, will represent a column-vector so that its transpose, 
say x*, will be a line- vector. 

2 The symbol ' is a common and useful notation to denote the updated frequency 
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a k = (K/k -1) / (K-l) with w fe+ i - w k = -^=i k (k+i) - Depending on s > or 
s < 0, the unit fitness 1 is either the minimal or the maximal value of the ordered 
WkS. Although this particular model does not cover the class of all possible fitnesses, 
its generality is sufficient for our purposes and allows a considerable simplification 
of the exposition which otherwise would become tedious. It does not alter the gen- 
eral line of thought in a major way. ♦ 



According to the EG dynamical system (U|), for each k, the relative fitness decreases 
as time passes by. Indeed, with AWfe (x) := Wk (x') — Wk (x) 

Aw k (x) = w k (L>w(x)Xj - w k (x) = — j- ^ — 

w (% x) x) w (x) 

w k w k 



J2iwiwi{x)xi J2i w i x i 



< 



because wi (x) = ^j^y and w (x) 2 < Yli w i x l- However, unless the equilibrium 
state is attained, the absolute mean fitness w (x) increases: 



Aw (x) = w (x') — w (x) = Wk^Xk 

k 



= V w k x k (w k (x) - f ) = ^ k *"*f k -w(x)> 0. 

w (x) 

k 

The mean fitness is maximal at equilibrium. The rate of increase of w (x) is: 

r*\ Aio(x) r-s ._ . . 2 t-^ (Ax k ) 2 
(5) = > x k (w k x) - I) = > 

which is the variance in relative fitness a 2 (x) defined in ([3]) . These last two facts 
are sometimes termed the 1930s Fisher fundamental theorem of natural selection 
(FTNS). 



Remarks. 

(i) The expression appearing in the right-hand side of (O is also 

The discrete frequency distribution x depends on the time parameter t 6 {0, 1, 2, ...} 

which is itself discrete. The quantity I x (t) := J2 k x k {~^f^j ma y therefore be 
interpreted as a discrete version of the Fisher information about t brought by x. 
From (O , we get that the rate of increase of the mean fitness (which is the variance 
in relative fitness) identifies with this Fisher information 

(6) ^g)^ ^ 

w (x) 

(ii) When w k = 1 + sa-k a $ in the selection example, the variance in relative fitness 
reads 

° 2 ( x ) = ( 1 i S i \ ) Y\x k {a k -a (x)) 2 . ♦ 
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2.2. Single locus: diploid population with K alleles. We now run into similar 
considerations but with diploid populations whose genetical information governing 
their developments is carried by pairs of chromosomes. When considering the es- 
timation problem, to avoid overburden notations that would blur the exposition, 
we shall limit ourselves to the special one-parameter fitness model where a single 
selection parameter s is unknown. Under this hypothesis, the estimation problem 
is over-simplified because it avoids estimating the full fitness array that would lead 
to additional notational and technical difficulties due to multidimensionality. 

Joint EG dynamics. Let w k .i > 0, k, I = 1, K stand for the absolute fitness of 
the genotypes A k Ai attached to a single locus. Assume Wk,i = w^k- Let W be the 
symmetric fitness matrix with k,l— entry w k ,i- Assume the current frequency dis- 
tribution at time t of the genotypes A k Ai is given by x ky i- Let X be the frequencies 
array with k, I— entry Xk,i- The joint EG dynamics in the diploid case is given by 
the updating: 

(7) x ^ =Xk ^ — 

where the mean fitness w now is given by: w (X) = ; Xk,iWk,i- Define the relative 
fitness of the genotype A k Ai by: Wk,i (X) := ^nq and let W (X) be the matrix 
with entries Wk,i (X). Then the joint EG dynamics takes the matrix form: 

X' = X o W (X) = W (X) o X 
where o stands for the (commutative) Hadamard product of matrices. 
Let J be the K x K flat matrix whose entries are all 1. Then 

AX := X' - X = (X - J) oW(X) =W(X) o(X -J). 
We shall also let 

K 

(8) a 2 (X)=J2^k,i(wk,i-w(X)f 

k,l=l 

stand for the genotypic variance in absolute fitness and 

K 

(9) W 2 (X) = £ x k ,i (W ktl (X) - l) 2 = a 2 (X) /w (X) 2 

k,l=l 

will stand for the diploid variance in relative fitness. 

Consider the problem of evaluating the increase of the mean fitness. We have 



(10) Aw (X) = V Ax k ,iw k ,i = V x Kl — k 4- - w kd \=w(X) g 2 (X) > 

kt kj J 



with a relative rate of increase: Aw (X) /w (X) = a 2 (X) . This is the full diploid 
version of the FTNS. 



Marginal allelic dynamics. Assuming a Hardy- Weinberg equilibrium, the fre- 
quency distribution at time t, say x k ,i, of the genotypes A k Ai is given by: Xk,i = 
x k xi where x k — x kt i is the marginal frequency of allele A k in the whole geno- 
typic population. The whole frequency information is now enclosed within x := x k , 
k = 1, K. For instance, the mean fitness w now is given by the quadratic form: 
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w (x) = '^2 k j XkXiWk,i = x*T4 / x with x* the transposed line vector of the column 
vector x = XI (1 the unit K- vector). We shall also let 

K 

(11) a 2 (x) = ^2 x kXi (wk,i - w (x)) 2 

k,l=i 

stand for the genotypic variance in absolute fitness and 

K 

a 2 (x) = ^2 x k x i i w k,i (x) - l) 2 = a 2 (x) /w (x) 2 
fe,i=i 

will stand for the diploid variance in relative fitness with Wk.i (x) := Wk,i/w (x) the 
relative fitnesses. These quantities may now simply be indexed by x. 

Before we come to the diploid marginal EG dynamics, let us make the following 
remarks. Let 

K 

(12) S 2 (a) := 22 x k x i ( w k,i -w(K)-a k - at) . 

k,l=l 

The values of a minimizing S 2 (a) are easily seen to be a* = a k = uik (x) — w (x) , 
k = 1, ...,K. We shall let 

K 

(13) a 2 D (x) := S 2 (a*) = 2J x k x i (w k ,i - (w k (x) + wi (x) - w (x))) 2 

k,l=l 

and call it the dominance variance. Then we get 

(14) a 2 (x)=a 2 ,(x)+ ( T 2 4 (x). 

The variance a 2 A (x) is that component of the total variance in absolute fitness of 
the genotypes which can be explained additively by the alleles constituting those 
genotypes, [2]. Indeed, we can easily check that 

K 

(15) a 2 4 (x) = 2^x fc K(x)-w(x)) 2 . 

fc=i 

The number a\ (x) /2 can be interpreted in terms of the fitness covariance between 
parent and offspring in the updating step (see Ewens, [3], p. 7). 

The residual part a 2 D (x) is that component of a 2 (x) which can be explained by 
the interactions pertaining to dominance between the alleles forming the genotypes. 

Consider now the update of the allelic marginal frequencies x themselves. If we 
first define the frequency-dependent marginal fitness of Ak by Wk (x) = (Wx.) k := 
Y]f Wk^xi, the marginal dynamics is given as in ([4]) by: 



(16) x' k = x k w k (x) =: p k (x) , k = 1, ... 7 K 

-uj(x) 



where now: Wk (x) := is the relative fitness of Ak- In vector form 



x*Wx ~ W(X)X ' 
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where w (x) := Wk (x) , k = 1, K. Again, the mean fitness w (x) , as a Lyapounov 
function, increases as time passes by. We indeed have 



This quantity is half the allelic variance in relative fitness a\ (x) / \2w (x) 
a\ (x) /2. Indeed, 



The mean fitness increase phenomena (either global or partial) occur till the EG 
dynamics reaches an equilibrium state. In the diploid case, this dynamics can 
have more complex equilibrium points, satisfying Wk (x eq ) = w\ (x e(J ), k = 2, K 
and ^2iX eq i = 1. In particular, a stable internal (polymorphic) equilibrium state 
can exist, a necessary and sufficient condition being that W has exactly one strictly 
positive dominant eigenvalue and at least one strictly negative eigenvalue (see King- 
man, H] ) or else that the sequence of principal minors of W alternates in sign. An 
internal polymorphic equilibrium state is asymptotically stable iff it is an isolated 
local maximum of the mean fitness. If this is the case, there is a unique z > 
for which Wz — 1 and the equilibrium polymorphic state is x e(? = z/ |z| . More- 
over, starting from any initial condition in the interior of Sk> all trajectories are 
attracted by this x eg . When there is no such unique globally stable polymorphic 
equilibrium, all trajectories will still converge but perhaps to a local equilibrium 
state where some alleles get extinct. 

Except for the fact that the mean fitness now is a quadratic form in x and that 
the marginal fitness of Ak now is frequency-dependent, depending linearly on x, 
as far as the marginal frequencies are concerned, the updating formalism (|16[) in 
the diploid case looks very similar to the one in Q describing the haploid case. In 
the diploid case, assuming fitnesses are multiplicative, say with Wkj = WkWi, then 
Wk (x) := xm^x = JTmxi an< ^ * ne dynamics l[16p boils down to (HJ). However, the 
mean fitness in this case is w (x) = (y\ wixi) 2 and not wixi as in the haploid 
case. 

Example (selection). In general the whole fitness matrix is unknown. In some 
cases, only one selection parameter s is to be determined (estimated from data). 
Assume indeed w^j = 1 + sa^^i where a^j — aik G [0, 1] are known and s > — 1. 
A natural choice could be au,i = ^rf"^r> with a\.\ = 1 and o,k,k = 0. Or else: 





Its partial rate of increase due to frequency shifts only is 

A P w (x) _ Y,k & x kWk (x) 
w (x) w (x) 




(17) 
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ak.i = (K/k — 1) (K/l — 1) / (K — l) 2 . The simple popular model ak,i — 5k,l is also 
of wide use in this context (see [3], p. 53 or [7] p. 14). 

Then W = J + sA where A is a known matrix whose k,l— entry is au,i- With 
ak (x) = CLkjXi and a (x) = x*Ax, the EG dynamics reads 

(18) x' k =x k \ + ; ak ^ ,k = l,...,K. 

1 + sa (x) 

The allelic variance in relative fitness reads 

(19) o\ (x) /2 = I " ) 2 K W - « W) 2 • ♦ 
Remarks: 

(i) There is an alternative vectorial representation of the dynamics (|16l) and (fT5| . 
Define the symmetric positive-definite matrix Q (x) with quadratic entries in the 
frequencies: 

Q ( X )fc,Z = X k ( S k,l ~ Xi) . 

Introduce the column vector of the relative fitnesses: w (x) = Wk (x) , k = 1, K 
(with w (x) =: W (x) = \ V log w (x) , half the gradient of the logarithm of mean 
fitness). Then, (|16p may be recast as the gradient- like replicator dynamics: 

(20) Ax = Q (x) w (x) = — ]—rQ (x) Wx = Q (x) (x) , 

to (x) 

with |Ax| = l*Ax = as a result of 1*Q (x) = 0*. Note 

W (x)* Ax = W (x)* Q (x) W (x) > 0. 
In the selection case when wk (x) = 1 + sak (x) , using Q (x) 1 = 0: 

Ax = Q (x) (w (x) - 1) = S Q (x) Ax. ♦ 

1 + sa (x) 

(ii) Although we shall not run into details pertaining to the continuous-time setting, 
let us say a few words on this particular aspect. In continuous-time t > 0, the 
dynamics of Xk := Xk (t) is 

x k = x k (w k (x) - w (x)) , k = 1, K 

where the 'dot' is the time-derivative. The growth rate is driven by the average 
excess in mean fitness Wk (x) — w (x) . Alternatively, the dynamics on Sk is x 
= Q (x) Wx with f t |x| = because |x| = l*x = (1, x) and 1*Q (x) = Q (x) 1 = 0. 

In the special selection case, x = sQ (x) Ax or 

Xk = sx k (a k (x) - a (x)) , k = 1, if. 
In this case, the positive quantity 



k=l 



Xk 

fe=l K fe=l 
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may be viewed as the familiar Fisher information / x := (t) of the frequency 
distribution x, as a discrete probability distribution parameterized by continuous 
time t. One can check that, if the mean fitness is w (x) = 1 + sa (x), then 

w (x) = sa (x) = 2/ x . 

So the time derivative of w (x) coincides with twice this Fisher information. This 
constitutes the diploid continuous- time version of ([6]). 

Defining the dimensionless parameter 6 :— st and looking at the time-changed 
frequencies ir k (0) := x k [0/s), we get the s— free dynamics 

TTfc = TTfe (a/t (tt) - a (tt)) , A; = 1, ...,K, 

where the 'dot' now is the derivative with respect to 9. Clearly w (tt) = a (tt) = 
2I n {6) > 0. ♦ 

Partial change of mean fitness. Let us return to the joint EG dynamics where 
no hypothesis on mating was made and consider the full mean fitness 

(21) w(X) = y^gfc.itUfc.i- 

k,i 

Define a k = w k (X) - w (X) where w k (X) := J2i Xk,iw k ,i/x k and x k := J2i x k,i is 
the marginal frequency of A k . Replace the expression (|2Tj) by the equally correct 

(22) w(X) = J2 x k: i (w (X) + a k + ai) , 

fe,i 

suggesting that the fitnesses of the genotypes A k Ai would rather be wj?) := w (X)+ 
a k + on. 

Define the partial change, say Ap, of mean fitness as 
Apw(X) :=J2 Ax ^i ( w ( X ) 

+ a k + ai) 

k,l 

where only a variation in the frequency term is considered. After some elementary 
algebra, we get 

A P w (X) = ^2 ^ x k,i (a k + m) = 2 ^ a k X! ^ Xk < 1 

k,l k I 

= 2^a fc Ax fe =:g\ (X) /w (X) , 

k 

leading to a partial rate of increase Apw (X) jw (X) = a\ (X) which is similar to 
(|17p . In the alternative Castilloux-Lessard interpretation of this phenomenon, pQ , 
defining Ax k a } := Ax k j and observing 

A P w(X) = J2 A ^i w k,l, 

k,l 

the partial change involves an allele-based modification of the genotype frequencies 
while the genotype fitnesses w kt i are kept unchanged. 
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2.3. Estimation of s. We now switch to the announced estimation of s problem 
which seems to be new. 

Assume the current and updated frequencies x and x' are being observed at some 
times t, t + 1. We wish to use this information to estimate the unknown value of 
s. Let first s k be the estimate of s which explains the observable (x;x' k ) at best. 
Clearly, from |HD, 

(23) st- r[ ~ n ' 



ak (x) x k - x' k a (x) 

does the job. A natural estimator s* = s* (x;x') of s which explains best the 
observable (x; x') is: 

s* = argmin > Xkx' k (s — st 



which is: 

(24) s* = 1 ^2x k x' k s* k , 

a weighted average of the s£ attributing more credit to s k when XkX k is largest. 

Sometimes, the Xk, x' k are not directly observed. Rather, what is observed is the 
scalar output: 

y := h (x) := 2J x k h k (x) 

k 

for some known family of measurements hk (x), k = 1,...,K given the process x 
is in state k. Simple but important examples are hk (x) = x k -1 (a > 1) in which 
case, y = h (x) = J^k x k IS a— homozygosity (typically a = 2), or hk (x) = — logx^ 
in which case, y — h (x) = — ^2 k Xk logXfe is the Shannon entropy of the frequency 
distribution. 

Let k be a discrete random variable with P (k = k) = Xk, k = 1, ... 7 K so that 
y := E (h K (x)) , the mathematical expectation with respect to ks law. From (p~8|) . 
we have: 

y' = h (x) := 2J ^/c^-fe (x') = Ax k hk (x') + 2J ^fc^fc (x') 

k k k 



k 



x k h k (x') (a* (x) - a (x)) + E (h K (x')) 



1 + so (x) ^ ' 

Cov(/i K (x'),a K (x))+E(/i K (x')). 



1 + sa (x) 

Therefore, the observed shift in the measurement is 

(25) Ay = - - Cov (/i K (x') , a K (x)) + E (A/i K (x)) 

1 + sa (x) 

and an estimate s* based on Ay can immediately be written down by mere solving 
the above equation (j25|) which is reminiscent of a Price equation (see [4]). It involves 
two terms, one which is related to the correlation between the measurement at t + 1 
and the fitness function at t that has to do with the frequency shift only, another 
related to the induced change of the character value only. 
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3. EG THEORY: THE STOCHASTIC POINT OF VIEW 

With w k (x) := let p k (x) := x k w k (x), k = 1, ...,K with J2 k Pk (x) = 1 be 

defined as in the previous Section cither from allelic or genotypic fitnesses. We 
shall assume throughout that the special selection model assumptions: w k (x) = 
f + sa k (x) and w (x) = 1 + sa (x) are at stake. 



3.1. The Wright-Fisher model. We start considering similar problems under 
the Wright-Fisher model. 



The Model and first properties. Consider an allelic population with constant 
size N. In the haploid (diploid) case, N is (twice) the number of real individuals. 
Let i := i k and i' :— i' k , k — l,...,K be two vectors of integers quantifying the 
size of the allelic populations at two consecutive generations t and t + 1. With 
|i| = J2 k ik, therefore |-^| = = 1 on Sk ■ Suppose the stochastic EG dynamics 
now is given by a Markov chain whose one-step transition matrix P from states 
I = i to I' = i' is given by the multinomial Wright-Fisher (WF) model 

(26) P(i; +1 = i' I I t = i) =: P(i,i') = *.,J f[ Pk (1 

The state-space dimension of this Markov chain is (^*) (the number of composi- 
tions of integer N into K non-negative parts). 

Let e; be the K— null vector except for its I— th entry which is 1. The extremal states 
S* K := {ij* := Nei,l = 1,...,K}, are all absorbing for this Markov chain because 

p k = $k,l ■ Under our assumptions, the chain is not recurrent. Depending on 

the initial condition, say io, the chain will necessarily end up in one of the extremal 
states i;*, with some probability, say tti (io), which can be computed as follows. Let 
7T; := 7Tj (i), i G Sk be an harmonic function of the WF Markov chain, solution to: 

(27) (P - /) m = if i G S K \S* K and m = 1 (i = iVe ; ) if i G S* K . 
It satisfies 

P (I r = iVe ; | I = i ) = ^ (io) , 

where r (< oo almost surely) is the random hitting time of S K for I t and the tt 1 (i )s 
are normalized so as ^ 7r/ (i ) = 1. Thus iri (i ) is the searched probability to end 
up in state Nei starting from state io- In the same vein, the expected hitting time 
a (io) := Ei (r) solves: 

(P-I)a - l,i/NeS K \S* K 
a = 0, i/N e S* K 

where a := a (i), i G Sk- With 7T/ the solution to the above Dirichlet problem, the 
equilibrium measure of the chain therefore is: 

K 

i=i 
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which depends on io. Unless some prior information on io is given, we may assume 
that io = -j^l in which case one expects ni (io) = l/K and Tt eq is uniform on the 
extremal states. 

Necessarily, one allele will fixate and there is no polymorphic equilibrium state even 
when dealing with diploid populations. Which allele and with what probability will 
depend on the initial condition. Thanks to fluctuations, the picture therefore looks 
very different from the one pertaining to the deterministic theory. For analogies of 
this construction with statistical physics, see [10], [9]. 

The marginal transition matrix from i to I' k = i' k is binomial bin(iV,f>fe (^)): 

With p k (jf) := j^Wk (jf), given I = i, the fc— th component I' k of the updated 
state is now random with: 

Ei (I'k) = Np k (±) and a? (I' k ) = N Pk (±\ fl Pk (± 



Mean fitness. We shall introduce the random increment in absolute mean fitness 
as 



,28, A W (^ := £(i_i)4_L) = ,£(fi_| 



k=l 



Ik 

N N 



N 



k=l 



N N 



ak[ N 



Dropping for notational case the argument appearing in Awy , a kl a and p k , we 

get 



K 



it fl + sa k 



k=l 



N V 1 + sa 



- 1 Ofc = 



2 K 
S \ - Ik 



(29) 



1 + sa 



K 



E l k 2 
N a * 



N 



1 + sa N 

k=l 



> 0. 



The mean of the increment of the random absolute mean fitness is positive (a 
random version of the FTNS). Its rate of increase is 

K 



(30) 



1 + sa 



E 

,fe=i 



% k 2 2 
T7 a k - a 

N k 



involving the variance of the a k s under the current frequency distribution k = 
l,...,K. 

Let us now compute the variance of Awy (jj)- We get: 



(31) 



of (a«*> = (e - «) - -v (e 



A" 



A' 



, k,k' = l 



N 2 



a k a k > 



\k=l 
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It is proportional to the variance of the weighted outcomes X^fcLi JT ak gi ven i- 
Using Ei (I' k r k ,) = N (N - 1) PkPk , and E, (l' k 2 ) = N Pk + N {N - 1) p 2 k , we get 

\ 2 



a\ (Aw I ,) = s 2 
and so. 



K 

E 

k,k' = l 



N(N-l) PkPk , 



N 2 



(32) 



of Aw r 



N 



s 

N 



-aka k > 



K 



K / K > 

\fc=i / 



fe=i 



A" 



fe=l 



\fc=l 



jk_ 1 + Sdk U I 7^ 



again involving the variance of the a k s but now under the updated mean frequency 
distribution pk = E» (^J = ^ 

We conclude that 
(33) 

w {w) 

as one might have expected from the analogies with the deterministic theory. 

In fact, the full law of Awy (-^) can be computed and the large N population limit 
is worth investigating. Indeed, its Laplace-Stieltjes transform (LST) reads 



* of ( Aw v [ - 



Ei (er AAwi '(*r)) = e Asa Ei C 



A 



N 



As <r^K ji 



H"Pk 

\k=l 



e N 



(a k ~a) 



suggesting from large deviation theory that, if i k := [Nx k \ , k = 1, K 



\ N J at 



K 



An'ii { N * ) a A' Awv (x) = s^p k (x) (a k (x) - a (x)) 



k=l 



K 



1 + sa (x) 



^2x k a k (x) 2 - a(x) 2 , 



\k=l 



which is the deterministic value u\ (x) / (2 (1 + sa (x))) of the marginal determin- 
istic theory (JT5J) . 



Statistics. We now suppose the WF Markov chain is in state i, with i ^ i* so that 
it has not yet reached any of its equilibrium states. Based on the observation i, we 
would like to design estimators of the selection parameter s. 

The log-likelihood of the model (pM)) is 

K 



N 



logP(i,i') = log ., + log 



N Wk ( N J ^ W ( N 



If w k = 1 + sa k and w = 1 + sa, its derivative with respect to s is 



A 



5 s logP(i,i') =d s > ^ i' fe log (1 + sa k ) -N log (1 + so) 



v.k=l 
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K 



The value s 



MLE 



^MLE 



fc=i 



\N> N J 



ak 

+ sa k 



N- 



1 + sa' 



for which <9 S logP (i, i') = is the Maximum 

Likelihood Estimator of s given the observable (i; i') . It is given by the implicit 
equation 



K 



(34) 



Eth. _ 
N 1 



life A? 



fc=l 



+ s 



MLB 



«fc (£) 



1 + S 



MLE 



It is probably biased. Let us compute the Fisher information on s enclosed in the 
observation (i; I') which is 



(35) 
We get 



/i(*) = Ei (0 s io g p(i,i')r 



7 i ( S ) = ^P(i,i') 



K 



Ofe 



.fc=l 
ak 



Na 



+ sa k 1 + sa 



Na 
+ sa 



and therefore 
(36) 



N 



1 + sa 



Lfe=l 



+ safe 1 + sa 



The Fisher information is exactly the variance of the weighted outcomes ^2k=i~N i+sa k 
given i. We conclude that 



(37) 



E 



iAm ' ^Ki i{s) . 



From the expression of the mean Ei (I' k ) 



E ; 



i k 1 + sa fe 



iV 1 + sa 

and, with {i' k ) i = — J2m=i *fc m ^ nc empirical average given i of I' k based on a size-n 
sample observation of I' k 



Sk afe(^)«fc-(«'fe) i a(i7) 
is a first moment estimator of s explaining best (i;i' k m ,m= 1, nj and 



(38) 



s * = ^ rrr~ y^*fc s t, 



a moment estimator of s explaining best (i; i' m , m = 1, n) whenever we are in 
possession of n observed copies i' of I' based on the same i. This estimator is also 
biased. 
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(39) 



i?i (s, s) 



(s-sf = a? (s) + (Ei (S) - sY 



be the quadratic risk function associated with the estimator s. By the Frechet- 
Darmois-Cramer Rao theorem, we have 



(40) 



Ri (s, s) > 



1 



where, in this classical interpretation of the Fisher information, I\ (s) appears as 
a universal lower bound of the estimator quadratic error. 



Finally, we would like to stress that these considerations are also relevant in the 
context of another fundamental stochastic model arising in the context of evolu- 
tionary genetics. We shall give some elements of how to proceed with this model 
presenting very different properties. 



3.2. The if— allele Moran model. We now focus on the estimation problem un- 
der the Moran model. 



The Model. Let a, f3 £ {1, K} . In the Moran version of the stochastic evolu- 
tion, given I t = I = i, the only accessible values of I' are the neighboring states: 
•a 3 '■— 1 + d a ,B where d a ^ — (0, ..,0, — 1, 0, 1, 0, 0) . Here —1 is in position a 
and 1 in position [3 ^ a corresponding to the transfer of an individual from cell 
a to cell (3. The Moran stochastic EG dynamics now is given by a Markov chain 
whose one-step transition matrix P from states I = i to I' = i' is: 

(41) P (T+i = i' | I t = i) = if iV 4,/3 and 

P (I t+1 = | I t = i) =: p(i ! i^) = ^ (1), 
where pp (±) is given by p 8 (i) := l -§w B 

Summing P fi, over a, (3, (3 ^ a in (|4"Tj) . we get the holding probability 

completing the characterization of the if— allele Moran model. The probability 
that in a one-step transition, the size of allele A a population shrinks of one unit is: 

The probability that in a one-step transition, the size of allele A@ population un- 
dergoes a one unit growth is: 
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As a nearest-neighbor random walk model, the Moran model has a much simpler 
transition matrix P of the Jacobi type. The equilibrium measure of the chain again 
is: 



K 



(42) TT eq :=^2n(i )Si V 



1=1 



where 7T; again solves the Dirichlet problem (f2T)) but with this new simpler Jacobi P. 

In what follows, we assume the special one-parameter selection model leading to: 

_ ip 1 + sap 



N J N 1 + sa 



Mean fitness. Let us compute the LST of J2k a kl'h m the context of a Moran 
model. We get the factorized form: 

E,( e -*IWi)= ]T e- A &«»<^Wp(i i y +e -^»«A^ip / , 



-*J2 k a k i k I ^2 e -H^-a a )l2. na Hi 
K a,0:a^0 



^ AT ^ N' 





7v TV ^ N 

13 



Recalling Awr (-^) = s^tLi (at ~ lv) this leads in particular to (compare 
with PS])): 



(43) EiCA^O = £ (e, -£>«*) = ^ (w" |) 



fc=l / fe=l / \ /3 



w(1+80)l i:M- 2 i " 
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The variance of Au>p (-h) could easily be computed. Using the above result, we 
indeed get: 

Ei (e- £, Ai/ 9 = e- S, a,*, 2 ^ £ e -A, p/Jj 

giving the joint LST of I' given I = i. Putting A; = if / ^ k, the k th — marginal 
reads: 



Ei ^e- AfcI *J = e~ A ^ ^1 - ^ + e x - ^ J (l -p fc + e^Pfc) 

which is of the random walk type. Indeed, we get: Pi (I' k = i' k ) = if i' k ^ ik ± 1 or 

ifc 7^ ife and 

Pi(4 = i fe ) = (i-|) (i-p fc ) + |p* 

Pi = ife + 1) = (l - p fe ; Pi (I' k =i k -l)= l ± (1 - Pfc ) . 
In the special one-parameter case, we have 

Ei (4) = *k + Pfc - T7 = Ik + ITT 



AT/ AT V 1 + sa 

Using previously introduced notations, this gives a first moment estimator of s 
explaining best (\;i' k m , m = 1, n\ as: 

(44) 4 = "(ft'--*). 

«fc (a* - a) 



4. Concluding Remarks 



In this Note our concern has been to introduce the general formalism of evolutionary 
genetics dynamics under fitness, in both the deterministic and stochastic setups, 
and chiefly in discrete-time. In the stochastic version of the problem, both the 
Wright-Fisher and the Moran models were considered. In the process, we revisited 
the various facets of the famous Fisher theorem of natural selection in both the de- 
terministic and stochastic formulations. For the sake of simplicity of the exposition, 
we limited ourselves to a simplified one-parameter model where the sole selection 
parameter is unknown. Using these preliminary results and facts, we discussed 
the estimation problems of the selection parameter based on a single-generation 
frequency distribution shift under both deterministic and stochastic evolutionary 
dynamics. To the best of the author's knowledge, this particular way to address 
the estimation problem is new. It was stressed that in our models, there were no 
mutation effects included. We plan to include these effects in a forthcoming work. 
When mutations are present, the situation changes drastically. Firstly, in the de- 
terministic formulation, the replicator dynamics combining fitness and mutations 
no longer is gradient-like in general. Still, an internal equilibrium point can exist 
were fitnesses to be multiplicative or would the mutation rates satisfy a house of 
cards condition [7]. Secondly, in the stochastic formulation, the Markov chains 
under study (either Wright-Fisher or Moran) are ergodic, now with an invariant 
measure which is independent of the initial condition. This also changes the picture 
drastically. 
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